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EXPERIMENT IN LEARNING TO DISCRIMINATE FREQUENCY TRANSPOSED 
SPEECH 



Ahlstrom K.-G., Risberg A. and Lindhe V. Experiment in learning 
to discriminate frequency transposed speech. Rep. Inst, of Educ. 

No. 36 1967. Forty-four Ss were trained to discriminate between 
transposed words or syllables. Since the ^3 had normal hearing, 
it was necessary to simulate a hearing loss by Including a low-pass 
filter and white noise in the circuit for transposing, the speech 
signals. The _Ss were given ton practise sessions. At sessions 
1, 3, 5, 7, 9 and 10 they were given a test. 

The consonant sounds m.ost easily identified by the Ss at the first 
test session were /g/, /k/, /sp/ and /sk/. The sounds /d/ and /q/ 
were the most difficult to identify. 

The changes in sound discrimination which take place with training 
occur rapidly and it is doubtful whether further training would 
have had any appreciable effecb on discrimination. The _Ss learned 
to rely in their discriminations on the code dimensions voicing, 
duration and cluster consonant but not on place of articulation. 

No standpoint can be adopted as to which of the two training methods 
compared here - anticipation and recall for paired-associate lear- 
ning - is the more e*FfectiveD It does not seem to matter which 
of the two training materials - v;ords or syllables - is used. 



Background and purpose 

A hearing loss is generally less pronounced in the bass portion of the 
frequency band. When this is the case, it seems theoretically possible 
to improve speech perception by transposing the speech signals to lower 
frequencies. The approaches to this problem have involved transposing either 
the whole speech spectrum (Oeken, 1963; Pimonow, 1963) or the high- 
-frequency components of the speech signal (Kbnig-Eichler, 1954; Johans 
son, 1959; Ling-Druz, 1967). 

The ”transposer” invented by B. Johansson (Johansson, 1959) performs 
the last-mentioned operation. The block diagram in Fig. 1 shows the 
priiiTciples for this kind of transposition. 
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Block diagram of hearing aid for transposing high-frequency sounds 
to lower frequencies ("transposer"). From B. lohansson (1959, 1966) 
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Compressor ”1" normally works in its nonlimiting region and compressor 
”2” in the limiting region. The amplitude range of 20 dB is then com- 
pressed to a 3-dB range. The attack time is about 5 ms and release time 
about 50 ms. 

All speech components above the cut-off frequency of the high-pass 
filter are transposed down to the lov^- frequency region. This means, for 
instance, that /s/ will be perceived as a low frequency voiceless sibilant. 

Very few experiments have been undertaken to study the effects of 
transposition ( Wedenberg, 1959; Johansson and Sjogren, 1965; Ling and Druz, 
1967). 



The main purpose of the experiment reported here was to elucidate which 
aspects of the information in the acoustic speech signals were influenced 
by transposition. Another aim was to compare two different methods of 
training speech perception. 



Method 



Simulation of hearing loss 

Since the Ss had normal hearing, it was necessary to simulate a hearing 
loss by including a low-pass filter and noise in the equipment for trans- 
posing the speech signals. Fig. 2 contains the block diagram for the 
equipment. 



S/N = 15 d3 
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Fig. 2 , Block diagram of set-up for recording the training tapes. The 

low-pass filter has a cut off frequency of 1000 c/s and a sharp 
cut off. The signal to noise ratio is read on a RMS instrument 
at the output from the taperecorder. 
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The cut-off frequency was 1000 c/s with very sharp cut off, and the 
signal-to-noise ratio 15 db. Without low-pass filtering a S with normal 
hearing would have perceived both the transposed and the non-transposed 
speech. Now only non-transposed components below 1000 c/s and transposed 
components are perceived. The testtapes were recorded through the ’’trans- 
-poser” by H. Sjogren at the Institute for Technical Audiology, Karolinska 
Institutet, Stockholm. The subjects listened to the signal through head- 
-phones TDH 39 at a comfortable listening level. 

Design 

The Ss were 44 students at the Institute of Education in Uppsala. They 
were randomly assigned to four groups of 11 Ss each; the groups differed 
with respect to practice material and training method. 



Group 


No. of Ss 


Practice material 


Training method 


1 


10 


Words 


Anticipation 


2 


11 


Words 


Recall 


3 


9 


Syllables 


Anticipation 


4 


8 


Syllables 


Recall 



Two Ss trained at the same tim.e with the same material but used differenu 
training methods. The pairing was random. The working pace was determined 
by the S using the anticipation method. Six Ss dropped out during the 
experiment because of sickness. In these cases the _Ss who dropped out 
had been training by anticipation. The experimenter determined the 
working pace for the remaining S utilizing the recall method. The actual 
nuni)Gr of Ss is given in the table above. 
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Practice matorial 

In a pilot study 42 syllables were presented by a ReVox G36 tape recorder 
to IB Ss. The speech was transposed in the way described above. The 
syllables contained 14 different consonQnt sounds, vi » /b/, /d/, /g/, 

/k/, /t/, /h/, /f/, /^/ » /c/, /sk/ and /sp/. Each consonant sound was 
combined with one of the vowels /a/, /o/ and /i/ to form syllables like 
/ba/, /bo/, /bi/ and so on. After a syllable had been presented, the Ss 
wrote down the letters they thought corresponded to the non-transposed 

sounds. 

The conditional frequencies showing how a given transposed sound was 
interpreted are to be found in Table 1. It should be observed that each 
consonant occurred three times and each vowel 14 times per 

Table 1. The conditional frequencies (percentages) in the pilot study . 
The column x contains all other letters reported by the Ss. 



S's responses 
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On the basis of the data in this input-output matrix, two consonant sounds 
were chosen from each "sound group", viz. /d - g/, /k - t/, /s - p/ and 
/sk - sp/. By combining them with the vowels /a/ and /o/, the following 
syllables were obtained: 

da, do sa, so 

ga, go tja, tjo (pa:, pu:3 

ka, ko ska, sko 

ta, to spa, spo 

These 16 syllables were arranged in random order in a list consisting 
of 48 items. The sounds /d/ and /p/ recurred five times, /g/ and /s/ se- 
ven times and the other conson art sounds six times in the list. This list 
will be referred to as the syllable list in the following text. 

By choosing from the endings -cka, -11, -m, -rra, -rv, -ta, -v 

-va and by adding the chosen ending to the syllables, the following words 
were obtained: 



data, dom 
gat a, gom 
kail, kova 
tall, tova 



sav, socka 
tjav, tjocka 
skarv, skorra 
span/, sporra 



A word list consisting of 48 items was arranged in exactly the same v/ay 
as the syllable list. 

Both the syllable and the word lists were divided into two halves, one 
part containing item nos. 1-24 and the other item nos. 25 - 48. Two 
versions of each list were obtained by reversing the consecutive order 
of the two parts, viz. version A; nos. - 243^|^25 - 48] , and version 
item nos. [25 - 48]->|l - 24 j , 

ERIC 
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Training methods 

Two training methods were compared: the anticipation method versus the 
recall method for paired associate learning. In both cases the task of 
the S was to learn to match transposed sounds (S) with the graphical signs 
(G) for the corresponding non-transposed sounds. 

In the anticipation method a transposed syllable or word, S., is pre- 
sented to the subject, who responds, R, by trying to identify the 
graphical signs of the non-transposed syllable or word, after which 
these signs, G., are shown to him. Accordingly the sequence is as 

follows; 

>G^; S 2 >R >^ 2 ’ 

In the application of the recall method a stimulus, S^, is presented 
to the S. At the same time he is shown the syllable or word, G., 
corresponding to S.. The correct matching, G., is printed in a booklet. 

The sequence is accordingly: 

^G^^; S 2 ^^ 2 ; •••* ^G^, 

Ten practice sessions were run over two weeks. Two trials on the whole 
list of syllables or words were given in session nos. 2, 4, 6 and 6. 

In session nos. 1, 3, 5, 7, 9 and 10 only one trial was given, followed 

by a test on all the items in the list. 

The order of the items in the list was varied in each practice session 
by alternating versions A and B, regardless of whether the session was 
characterized by two trials or a trial and a test. 
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In groups land 3, which practised by the anticipation method, a teaching 
machine (PAKS 1) was used. It was designed in the Speech Transmission 
Laboratory of the Royal Institute of Technology in Stockholm. The stimuli 
were recorded on one channel of a ReVox G36 taps recorder and tone codes 
which formed response signals to each stimulus were recorded on the other 
channel. These response signals were interpreted by an electronic tone- 
-code reader connected to the tape recorder. A keyboard with five columns 
and eighb rows (40 positions) was also connected to the tone-code reader. 
In each position there was a response key and an information panel. Trans- 
parent tickets bearing the relevant test for the respective experimental 
groups were attached to the panel. Below each panel there was a lamp 
which could be switched on, in order to give information as to the correct 
alternative (cf. Fig. 3 a and 3 b). 





Fig. 3 a. Schematic diagram of keyboard adapted for training v/ith syllables. 




Fig. 3 b. Schematic diagram of keyboard adapted for training with words. 



The apparatus functioned as follows. The _S was given a stimulus from the 
tape recorder, which then stopped. The next step was to press the keys 
bearing the letters which the ^ considered to be the correct translation 
of the stimulus. After that he pressed a button marked "correct response" 
and lamps lit up under the correct keys. Let us suppose that the non” 
-transposed sounds were /ska/ and the ^ pressed the keys marked ^ and 
When he pressed the "correct response" button, the keys marked ^ and a, 
respectively, were lit up. A new stimulus was presented when the S pressed 
a button marked "next word". 

A S practising by the recall method was paced by the ^ practicing by the 
anticipation method. In other words, the latter S decided when to give 
a stimulus. The former S just listened and read the translation of the 



stimulus in a booklet. 



AssGSsment nf transfer effects, motivation and verbal fluenc: ^ 



Before the first trial and after the tenth and last practice session a 
transfer test was administered (the same in both cases). It consisted of 
25 phonetically balanced and meaningful words. The & wrote down the 
trenslaticn they coriGiderad correct. 



At the end of the fifth end the tenth sessions the Ss rated their attitude 
to the training on a scc-.ls containing seven steps from "Very boring” to 
’’Very entertaining”. In the last session a test measuring "verbal fluency” 
was r^iven. The tas!; was to write down in three minutes as many Swedish 
w.'ords as possible beginning with s and ending with a. 

The various types of msnsurcments are surmnarized below. 



nefore session 1: 
End of session 
End of session 3; 
End of session 5; 



End of session 7: 
End of session 0*. 
End of session 10: 



Pre-test: Transfer 1 
Test on practice material 
Tent on practice material 
Test on practice material 
Attitude test A5 
Test on practice material 
Test cn prccbico material 
Test on practice mateiral 
Verbal fluency 
Attitude test AlO 
Post-test: Transfer 2 



Results 

Chang e s in t he numbe r of correct responses to wor ds and syllables 

The mean numbers uf ucrrect respenses in the tests on the practice material 
are plotted against tho number of test sessions in Fig. 4. 
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No. of cor- 
rect responses 



Fig. 4. Learning curves for the experimental groups. 
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Table 2. Neans and standard deviations for the experimental groups Craw 
scores) . 



Test- 
session 1 
M S 


Test- 
session 3 
M S 


Test- 
session 5 
M S 


Test- 
session 7 
M S 


^ 

Test- 
session 9 
M S 


Test- 

session 10 
M S 


Words, anti- 
cipation 


14.4 


6.80 


23.3 


2.47 


30.1 


3.B6 


30.0 


4.37 


31.3 


4.41 


30.2 


5.50 


Words, re- 
call 


17. B 


B.65 


25.3 


5.09 


26. B 


5.47 


29.3 


5.24 


23.4 


4.54 


29.0 


4.00 


Syllables, 

anticipation 


16.7 


6.02 


24. B 


6. IB 


2B.B 


3.5B 


30.2 


4.90 


31.4 


5.59 


32.6 


3. 83 


Syllables, 

recall 


16.0 


6.27 


27.2 


4.63 


27.2 


3.69 


28.2 


4.45 


29.4 


4. 28 


29.1 


4.99 



The data surrmarized in Fig. 4 and Table 2 were subjected to analyses of 
variance. Since the number of ^s varied between B and 11 in the experi- 
mental groups, the number of ^s was reduced to B by random choice. The 
results of the analyses are given in Table 3. 



Table 3. Analysis of variance of raw scores in the learning tests. 



Source of variation 


Sums of squares 


df 


Mean squares 


F 


Between Ss 


1 890.37 


31 


60.98 




Between training methods (A-R) 


4.37 


1 


4.37 


0,07 


Between materials (W-S) 


19.37 


1 


19.37 


0.29 


Interaction (A-R) (W-S) 


26.51 


1 


26.51 


0.40 


Error between Ss 


1 840.12 


28 

» 


65.72 





I 

i 

I 

I 

i 

i 

I 










To be continued. 
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Source of variation 


Sums of squares 


df 


Mean squares 


F 


Within ^s 


7 


517.83 


160 


46.99 




Between test sessions (Ts] 


4 


715.48 


5 


943.10 


29 . 58^^^ 


Interaction (A-R] (Ts] 




177.66 


5 


35.53 


1.11 


Interaction (W-S] (Ts] 




24.56 


5 


4.93 


0.15 


Interaction (A-R] (W-S] (Ts] 




27.02 


5 


5.40 


0.17 


Error within 


4 


463.38 


140 


31.88 




TOTAL 


9 


408.20 


191 







n = 32, A-R = anticipation-recall, W-S = words-syllables, Ts = test sessions. 

Only one of the main effects is significant; there are systesriatic changes 
with practice. The mean proportions of correct responses at the first 
test session vary between 0.29 and 0.37 in the four groups. The corres- 
ponding proportions at the last test session vary between 0.60 and 0.68. 
Accordingly, the proportion of correct responses has nearly doubled at 
the end of the practice period. Fig. 4 shows, however, that the learning 
curves are rapidly approaching their asymptotic levels. In other words, 
it is probable that further practice would hove had only slight influence 
on the performances. 

Changes with practice in the identifications of consonant sounds 

Tables 4 A - 4 F give the input - output matrices (or confusion matrices] 
containing the conditional frequencies in percentages for consonant sounds 
after practice sessions 1, 3, 5, 7, 9 and 10 for groups 1 and 2, prac- 
tising words. Tables 5 A - 5 F contain the same conditional frequencies 
for groups 3 and 4, practising syllables. 




Table 4: Conditional frequencies for consonant sounds in groups 1 and 2 , 



practising words . 



Subjects' responses 





d 


g 


t 


k 


s 


tj 


sk 


sp 


X 


' 

Correct d 


V 


58 




1 






1 




20 


responses 


23 


'57 

X 


1 








1 




18 


t 


1 


3 


X 


38 


1 


1 


2 


1 


24 


k 






43 


'‘52 

X 


0.5 


0.5 


2 




2 


s 


2 


2 


6 


5 




20 


6 




30 


tj 


1 




2 




40 


'^'18 


12 


1 


26 


sk 






1 


2 


3 




'^og 

X _ 


37 


18 


sp 












— ■■■> 


16 

t 


'^Bl 


23 



d g t k s tj sk sp X 

d 

g 
t 

k 
s 

tj 
sk 
sp 

d g t k s tj sk sp X 

d 

g 
t 

k 
s 

tj 

sk 



\o 

Xj 


56 










0.5 




3.5 


33 
















2 








43 






0.5 




4.5 






43 


^54 

X 






2 




1 




3 


1 






36 






4 


1 








55 


X42 






2 






2 


1 




\ 


“^57 


38 


2 










1 




17 


'"SO 

X 


2 



'^43 


53 










1 




3 


N 

36 


''61 


1.5 












1.5 


1.5 


1 


N42 


4g 


1.5 


1 


1 




3 






47 


■'49 

x^ 




1.5 


0.5 


0.5 


1.5 


2 


1 


1 


2 


~^55 


30 


1 




8 


1 


1 






56 


" 39 






3 






2 








" 67 

V 


28 


3 














22 


\75 

\ 


2 



A. Session 1; 
After 15 min. 
practice 



B. Session 3: 
Arter 45 min. 
practice 



C. Session 5: 

After 1 hour 15 min. 
practice 



sp 



d g t k s tj sk sp X 



\5 


54 








1 

mirn 








42 


s| 






















53 
















34 


'^66 






















30 
















50 


^^50 












1.5 


1.5 






1 1 

CD 


31 


2 














m 

1 — 1 


•^84 


0.5 



d g t k s tj sk sp X 



1 

C-^'l 

Z 


4B 
















39 


^61 




















\7 


53 
















34 


CD 1 


1 








1 










^68 

-- 


31 






1 






1 




49 


^ 49 

, ... -s. 






1 






1 


1 




\ 


\70 


28 
















10 


\90 

\ 





d g t k s tj sk sp X 





47 
















45 


^5^ 
















1 




^^52 

.. . N 


47 
















31 












1 

JL 










Vo 

N 


30 
















52 


^^48 












1 










28 
















10 


'-90 


K 



D. Session 7; 

After 1 hour 45 min. 
practice 



E. Session 9: 

After 2 hours 15 min. 
practice 



F. Session 10: 
After 2.5 hours 
practice 
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Table 5: Conditional frequencies for consonant sounds in groups 3 and 4, 
practising syllables. 



Subjects' responses 



d g t k s tj sk sp X 



Correct 


d 


7 

LO 


25 


13 


11 


4 


2 


14 


9 


7 


responses 


g 


11 


49j 


10 


2 


2 


6 


7 


6 


7 




t 


3 


1 


"33. 


44 

k - 


3 


i 


3 


2 


10 




k 


3. 


5 


25 


"55. 

N 


4 


2 


2 




6 




s 


1 


7 


2 


0 

0 


23^ 


22 


10 


"7 

/ 


13 




tj 


1 


2 


5 


5 


29 


18 


18 


10 


12 




sk 






2 




2 


4 


'"59 


22 


11 




sp 


1 


•-> 

3 


1 


4 


3 


2 


40 


"^35 

\ 


11 



A. Session 1: 
After 15 min. 
practice 



d 

g 

t 

k 

s 

tj 

sk 

sp 



d g t k s tj sk sp X 



"^0 


30 


12 


5 


5 




10 


8 




11 




6 


5 


2 




2 


10 


2 


3 


1 


'^38j 


58 












0 

/L 


2 


30 


V 




1 


, 


1 


2 




3 


1 


6 


^50^ 


37 






3 






2 


3.5 


36 


Ns 

V 


3.5 


5 


5 










2 


1 


7i 

cn 

/CD 


26 


2 










1 


1 


36 


"■59 

\ 


3 1 



B. Session 3; 
After 45 Uiin. 
practice 



d 

g 

t 

k 

s 

tj 

sk 

sp 



d g t k s tj sk sp X 



N 2 


28 


2 


4 


4 


1 


1 

4 


6 




16 


/ 

cn 


11 


3 


1 




2 


1 


1 


4 


1 


" 4 ^ 


50 

.. — 












2 


1 


29 


X 

6^ 


2 








1 




1 


3 


3 


, ^ \ 


33 






5 






3 


3 


33 






4 


1 














r/ 

CD 


16 


2 








1 




1 


33 


'^64 


1 



C. Session 5: 

After 1 hour 15 min. 
practice 




d g t k s tj sk sp X 

D. Session 7: 

After 1 hour 45 min. 
practice 



d g t k s tj sk sp X 

E. Session 9: 

After 2 hours 15 min. 
practice 



d g t k s tj sk sp X 

F. Session 10: 
After 2.5 hours 
practice 



'^5 


34 


6.5 




1.5 




1.5 


1.5 




19 


^69 

-- > 


6 


1.5 


1 




1 


1 


1.5 


2 




40^ 


56 










1 


3 


3 


25 






1 




2 








4 


5 


^63 

*v 


27 


1 










3 


1 


41 


'^55 




















'^80 


20 
















34 


^6B 

\ 





"^53 


29 


7 


5 






1 


5 




19 


>1 

V 


6 


0.5 








0 


0.5 


1 




^0 

N 


49 












3 


4 


27 






.1 








1 


1 

a. 


3 


3 


^B1 

N 


27 

L 


2 


1 


1 






1 




40 


^56 


1 




2 














"V8 

'V 


22 
















25 


^75 







'^56 

V| 


T 

19 


10 


5 






5 


5 




19 


>0 

N 


3 


1.5 


1.5 


1.5 


0.5 


3 




5 


1 


V 


44 






1 






1 


2.5 


28 




1 






2.5 


1 






2 


4 


/ 

CD 


28 




0.5 


1.5 






4 




35 


'^55 






6 














^81^ 


20 












1 




34 


^65 

N 
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The mean values of the diagonals, i.e. the average numbers of correct 
identifications of a consonant sound in the matrices, are plotted against 
the nunt)er of practice sessions in Fig. 5. 



flean percentage 
of correct 
identifications 



70 

65 

60 

55 

50 

45 

40 

35 

30 




T 1 1 r 1 1 1 I ‘ 

1 3 5 7 9 10 



Testsessions 



Fig. 5 . The average numbers (percentages) of correct identifications 
of a consonant sound. 



It is evident from Tables 4 and 5 and Fig. 5 that the two materials 
are about equal in difficulty and that the identification of a consonant 
sound generally improves with practice. Some differences between the 
miatrices for the word groups and syllable groupvS should, however, be 
ccrrmented on. 



o 



The X category contains responses other than the possible eight syllables 
or words. The number of wrong responses is higher in the word groups 



cit the initial stage of learning, probably because it is easier to learn 
what the alternative syllables are than what the alternative words are. 
After session 5 the frequencies in this category are about the same for 

both materials. 

The matrices corresponding to the tests at practice level 10 show some 
small differences between the materials. A plausible explanation of this 
is that the Ss' responses to the stimuli are influenced by their relative 
frequencies in the Swedish language, i.e. that a S prefers to guess at 
the most frequent word when uncertain how to discriminate between two 
words. The words ''sparv" (sparrow) and "sporra" (spur) are, for instance, 
more frequent in Sweden than "skarv” (splice) and ’’skorra" (grate). A 
S who cannot discriminate between /sp/ and /sk/ is more apt to guess 
at /sp/ than /sk/ in the word material, where ^ is a more frequent 
response to /sk/ than ^ is to /sp/. The opposite is true among the 
syllables, v*ere /ska/ and /ska/ are meaningful words but /spa/ and 

/spo/ are not. 

Chan ges in discriminations between co nsonant Bounds 

Parallel to the changes with training in the identi ftoation of consonant 
sounds, changes may occur in the discriminations between sounds. Tools 
for quantifying the degree of discrimination in each of the input-output 
matrices are provided by information theory (of., for instance, Attneave, 
1953). The S is regarded in the following discussion as a channel with an 

input and an output of information. 
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A 

Let H(x) be the amount of information at the input or the average uncer- 
tainty per element with reference to the classification ^<. An estimate 
of H(x) in "bits" is 

1 

1 X * 

H(x) = z: log2 = log2n - - i: log2 n^. 

The corresponding estimate of the amount of information at the output 
is 



H(y) = Z pj log 2 



1 i 

••= log2n - “ Z nj log2 nj 



The estimate of the amount of information necessary to locate an clement 
in both classifications, or the joint uncertainty, is 



H (x,y) = z'^ p log^ = log 2 n - Z'^ n.^ log n^^ . 



aj 



P. . 

ij 



A measure of contingency between input and output is 



T(x;y) = H(x) + H(y) - H(x,y). 



In other words, T(x;y) measures the reduction in the amount of information 
required to locate an element in one calssification, if it has been located 
in the other. It is the amount of information transmitted from stimulus 
to response. When there is no relation whatever between the _S^s responses 

A 

and the stimuli, T(x;y) = 0 "bits". The maximum T(x;y), given a certain 

^ A 

input, is equal to H(x), It should be noted that T(x;Y) does not reflect 
the degree of correspondence with an externally arranged pairing of sti- 
muli and responses. Transmission may "... be perfect even when some or 
all responses are 'wrong' by external standards, provided the subject 
is pcn'ectly consistent in his 'errors' and maintains his own one-to-one 
correspondences between responses and stimuli" (Attnoave, 1959, p, 50), 
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A confusion matrix was establishod for each S at each level of practice 
(session nos. 1, 3, 5, 7, 9 and 10) and T(x;y) was found for each matrix. 
The means and standard deviations in the four experimental groups at 
each test session are given in Table 6. 



Table 6. Neans and standard deviations of T(x;y). n = 36. 



Test session 


Words 


Syllables 


Anticipation 


Recall 


Anticipation 


Recall 




M S 


M S 


M S 


M S 


1 

1 


1.700 0.243 


1.527 0.606 


1.427 0.B4B 


1.504 0.938 


3 


2.062 0.12B 


2,034 0,098 


1.B12 0.500 


1.7B5 0,087 


5 


1.997 0.134 


2.017 0.099 


1.973 0.6B6 


1,886 0.387 


7 


2.130 0.100 


2.056 0.093 


1.9B9 0.424 


1.976 0.663 


9 


2.171 0.137 


2.135 0.033 


2.060 0.500 


1.950 0.469 


10 


2.199 0.^20 


2.121 0.024 


1.996 0.4B0 
\ 


1.946 0.489 



Afi analysis of variance with the same eight Ss in each group as in the 
earlier analysis gave the results summarized in Table 7. 

Table 7. Analysis of variance of T(x;y) values. 



Source of variation 


Sums of squares 


df 


Mean squares 


F 


Petween Subjects 


3.665 


31 


0.118 




Between A and R 


0.171 


1 


0.171 


2.036 


Between W and S 


1.150 


1 


1.150 


13.690' 


Interaction (A-R) (W-SO 


0 


1 


0 


0 


Error variance between 


2.344 


28 


0.0B4 





Ss within groups 




(To be continued) 
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Source of variation 


Sums of squares 


df 


Nean squares 


F 


Within Ss 


11.255 


160 


0.070 




Between testsessions (TS) 


6.333 


5 


1.267 


21.584' 


1. Interaction (A-R)(TS) 


0.016 


5 


0,003 


0.051 


2. Interaction (W-S)(TS) 


0.136 


5 


0.027 


0.460 


3. Interaction (A-R) (W-S) 

(TS) 


0.042 


5 


0.042 


0.716 


4. Error variance 


8.224 


140 


0.0587 





Total 14.920 191 

n = 32 A-R = Anticipation-Recall W-S = Words-Syllables TS = Testsessions 



Only two main effects were significant (£ < 0.001); between test sessions 
and between words and syllables. Thus, the degree of discrimination in- 
creased significantly with training and the groups that trained with 
words discriminated significantly better between consonant sounds than 
the groups that trained with sylld^bles. 

A closer inspection of the matrices in Tables 4 and 5 reveals that the 
primary basis for discrimination between the consonant sounds is the mode 
of articulating tfiem. Tables 8 and 9 below are identical viith Tables 
4 F and 5 F. Here the mode of articulation is marked with a thick line, 
while the place of articulation is marked with a dashed line through the 
relevant cells. It will be seen that sounds with the same mode of arti- 
culation cannot be discriminated from each other (cf., for instance, the 
sounds /d/ and /g/), while sounds with different modes of articulation are 
almost perfectly discriminated from each other. 
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Table 6. Percentual results of the word groups in the final test. 



d 

g 

t 

k 

s 

I • 

tj 

sk 

sp 



d g t k s tj sk sp X 



53 \ 


47 


’ 














45 


\ 

55 


: 














1 




52\ 


47 
















31 


\ 

68^ 






/ 

/ 

'/ 




1 










\ 


30 
















52 


'^48 












1 


\ 






’^7l" 

N, 


28 
















10 







Table 9, Percentual results of the syllable groups in the finc?.l test. 



d 

g 



k 

s 

tj 

sk 

sp 



d g t k s tj sk sp X 







F 




1,6 




1.5 


1.5 




18 




6 


1.5 


1 




1 


1 


1.5 


2 


1 


40 

\ 


56 


\ 








1 


3 


3 


25 


X 1 
56 

\j 




1 


\ 

\ 


2 








'"na' 

\ 


5 


S3 

\ 


27 


1 










3 


1 


41 


'^55 




















'So'~ 

N 


20 ~ 
















34 


56 

\ 
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In order to relate the changes in discrimination to specific acoustical 
characteristics, presumably discriminated by the listener, we shall adopt 
the same system of articulatory features as was utilized by Miller and 
Nicely (1955) in their study of perceptual confusions among consonants 
under various conditions of noise, low-pass filtering and high-pass 
filtering. We know from their study that place of articulation and 
duration are disturbed by low-pass filtering, while voicing and nasality 
are resistant to disturbances by low-pass filtering down to about 
600 c/s. Accordingly, it would be interesting to find out if this is 
true even in transposition. 

Table 10. The code dimensions of the consonant sounds. 



Conso- 

nant 

sound 


Voiced (1) 
Voiceless (0) 


Duration 
Long (1) 
Short (0) 


Mode 

Front 

Back 


(1) 

(2) 


Clustered (1) 
Single (0) 


s 


0 


1 


1 




0 


tj 


0 


1 


2 




0 


t 


0 


0 


1 




0 


k 


0 


0 


2 




0 


d 


1 


0 


1 




0 


g 


1 


0 


2 




0 


sp 


0 


1 


1 




1 


sk 


0 


1 


2 




n 

± 




“ - 

Four sets of confusion matrices were constructed, each consisting of two 
matrices. The sets corresponded to the code dimensions. The first matrix 
in a set contained the responses at session 1 and the other matrix the 
responses at session 10. The matrices are presented in Tables 11 A and B, 
12 A and B, 13 A and B and 14 A and B. 

Table 11. Voicing. A. Relative freouen cie s in percent a; 3 ;gs at session 1. 



B, Relative frequencies in percentages at ses sion i^_. 



0 


1 


Z 




0 


1 


Z 


96.68 


3.32 


100.00 


0 


99.16 


0.84 


100.00 


32.99 


67,01 


100.00 


1 


5.40 


94.60 


100.00 

j 



Voicing. A. Voicing. B. 



I able 12, Duration. A. Relative frequencies in percentages at session 1. 
B. Relative frequencies in percenta g es at session 10. 



0 


1 


Z 


89.09 


. 10.91 


100.00 


9.76 


90.24 


100.00 



Duration. A. 



0 


1 


z 


98.56 


1.44 


100.00 


1.45 


98.55 


100.00 



Duration. B. 



Table 13. Node. A. Relative frequencies in percentages at session 1. 



B. Relative frequencies in percentages at session 10, 



1 


2 


Z 




1 


2 


Z 


46.98 


53.02 


100.00 


1 


63.35 


36.65 


100.00 


41.97 


58.03 


100.00 


2 


34.86 


65.14 


100.00 



er|c 



I 

i 



1 



1 



i 



1 



I : 
-Hi 



Node. A. 



Node. B. 
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Table 14. Cluster consonants. A. Relative frequencies in percentages at 

session 1. B. Relative frequencies in percentages at session 10. 



0 1 2 0 1 



89.58 


10.42 


100.00 


0 


99.33 


0 67 


100.00 


6.31 


91.69 


100.00 


1 


0.25 


99.75 


100.00 



Cluster consonants. A. Cluster consonants. B. 



At the first session the voiceless consonant sounds (0) vvere percen/ed 
as voiceless in 96.68 per cent of all cases and as voiced in 3.32 per cent. 
After practice the corresponding figures were 99.16 per cent and 0.84 
per cent respectively. In other words, practice sharpened discrimination. 
If we consider the voiced sounds (1), the changes are even greater. 

Clear~cut changes with training are found in all code dimensions, but 
the Ss" ability to discriminate on the basis of place of articulation 
is poor, even after ten sessions of training. 

The improvement in transmission of duration found here may be due to the 
frequency transposition and may be also to the amplitude comipression. 

Transfer effects 

A test consisting of phonetically balanced words was administered before 
the first trial and after the last trial. The mean performance at the 
pre test was M = 5.0 and at the post-test M = 7.5. In order to study if 
all four groups werj characterized by the sam.e increments, an analysis of 
covariance was run partiallying the pre-test scores from the post-test 
scores. The product moment correlation between pre-test and post-test in 
the pooled groups is 0.59. The results will be found in Table 15, 
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Table 15, Analysis of covariance of transfer data. Pre-test kept cons- 



tant . 

Source of variation Sums of squares df Mean squares F 



Between groups 


1.27 


3 


0.42 




Within groups 


159.40 


33 


4.03 




Total 


160.75 


36 







The differences between the groups, as regards changes in the transfer 
test, with practice, are not significant. Accordingly, it is impossible 
to tell if the overall change in mean perfonnance is an effect of real 
transfer. It may also be a result of adjustment to the testing situation, 
for instance, a warming-up effect. 

The product moment correlation for the pooled groups between the two 
attitude effects is 0.60. The correlation between the attitude test 
and the number of correct responses at session 5 is r = 0.09 and at 
session 10 r = 0. 16. 

Attitudes to training 

All the ^s rated their attitudes to training at the end of sessions 5 and 
10. The differences between the mean values in the four groups on each 
occasion were tested by analysis of variance. The results are presented 
in Tables 16 and 17. 
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Table 16. Analysis of 


variance of 


attitude-test data 


at session 5. 




Source of variation 


Sums of 


squares 


df 


Mean squares 


F 


Between groups 


17.98 




3 


5.99 


12 . 22 XX 


Within groups 


16.32 




33 


0.49 




Total 


34.30 




36 







The differences are significant (p < 0.01). By pairvjise comparison of 
each group mean with every other and ;t-testing the differences, it 
was found that the attitude levels were highest under the condition of 
anticipation of syllables and lowest under the condition of recall of 
words . 



Table 17. Analysis of 


variance of attitude- 


-test data 


at session 1C). 


Source of variation 


Sums of squares 


df 


Mean squares F 


Between groups 


7,31 


3 


2.44 1.76 


Within groups 


45.19 


33 


1.37 


Total 


52.5 


36 





In this case no differences between groups could be found. The product 
moment correlation between the two attitude measurements Is 0.60 in the 
pooled groups. The correlations between the attitude test and the number 
of correct responses at session 5 and session 10 are 0.09 and 0.16 res- 
pectively. Since none are significantly greater than r = 0, it is not 
probable that the ^s^ attitudes have influenced the performances. 
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Verbal fluency 

The verbal-fluency task consisted in writing down in three minutes as 
many Swedish words as possible beginning with _s and ending with a. 

The correlations between this test and a sample of the other variables 
are reported in Table IB. 



Table IB. Correlations between verbal fluency and other variables. 





Attitude 


Transfer 


Practice 




5 


10 


1 


10 


1 


5 


10 


Verbal 

fluency 


0.15 


0.20 


0.2B 


0.42^^ 


-36'^ 


0.04 


0.16 



Verbal fluency is positively correlated (ji < 0.01) with the post-test 
on transfer. A slight positive relation is also noticeable at the pre- 
-test. The reason for this may be that Ss with high verbal fluencies are 
more able to utilize the perceived cues for guessing according to 
the statistical structure of S^vedish words. It may seem puzzling, how- 
ever, why verbal fluency is negatively related to the number of 
correct responses at the first session of training. 

The reasons may be that the Ss were not informed at test session 1 that 
the test stimuli were the same as the stimuli just presented in the 
preceding trial. Accordingly, Ss with high verbal fluencies may have 
relied on the same guessing habits as in the transfer test. If this were 
the case, they would be mislead, because the statistical structure of 
the syllables and words is not the same as the structure of Swedish words 
in general. 
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Sunrmary and discussion 



The consonant sounds most easily identified by the _Ss at the first test 
session were /g/, /k/, /sp/ and /sk/. The sounds /d/ and /c/ were the 
most difficult to Identify. This is demonstrated in Fig. 6, in which the 
increases and decreases in the relative frequencies of identifications 
from session 1 to session 10 are plotted against the frequency with 
which a given sound was identified at session 1. 



Increases 
and decreases 
from sessi^ih 
1 to session 
10 . 

50 
40 

30 - 
20 - 
10 

0 4 - 
-10 




10 20 30 40 50 60 70 



Word groups 
Syllable groups 



Relative frequencies of identification 
at session 1. 



Fig. 6. Effects of training on identifications of consonant sounds. 

The greatest improvements with practice in relation to the maximum 
possible gain are found in the sounds /spA /sk/ and /s/. 




The changes in sound discrimination which take place with training occur 
rapidly and it is doubtful whether further training would have had any 
appreciable effect on discrimination. The initial value of the amount 
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of information transmitted is in the range 1.4 - 1.7 "bits”. The primary 
reason for this is probably that the Ss had normal hearing. A S with 
normal hearing is able to recognize the low-frequency sounds which have 
not been transposed, and it is possible that he can shift his reference 
system when the frequency range of the fricatives is shifted. For a per- 
son with serious impaired hearing since birth, the initial amount of 
transmitted information would be close 0 "bits", because he has no 
reference system and thus all sounds are nav to him. 

An interesting point is that the ^s could learn to rely in their 
discriminations on the code dimensions voicing, duration and cluster 
consonant but not on place of articulation. A training experiment should 
be performed in which transposition with and without compression is com- 
pared with low-pass filtering, in order to record more specifically the 
factors which determine the perception of this code dimension. 

No standpoint can be adopted as to which of the two training methods - 
anticipation and recall - compared here is the more effective. Anti- 
cipation seems to b. somewhat - but insignificantly - more effective 
Ccf. Fig. 4). Other studies v^iith paired-associate materials show that the 
anticipation method is more effective when the stimuli are difficult to 
code, and the recall method is more effective when the number nf respon- 
ses exceed the memory span of the S CBarch and Levine, 1967). In other 
words, the outcome of a comparison between the two training methods may 
depend on the number of stimuli, the type of stimuli, the cut-off frequency 
of the low-pass filter and, in the case of Ss with impaired hearing, 
the degree of hearing impairment and linguistic competence. 



- 33 - 



It does not seem to matter which of the two training materials - words 
or syllables - is usedi But syllables appear to be somewhat " though in- 
significantly - easier to learn. 

The results of the training sessions vary with the ^"s degree of verbal 
fluency. A S with high scores in the S-A test tends to get high scores 
in the training sessions. In future experiments different verbal abilities 
could be combined in order to find out which of them are relevant for this 
type of training experiment. 



o 
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